Speaker orientation estimation based on hybridation of GCC-PHAT and HLBR
نویسندگان
چکیده
This paper presents a novel approach to speaker orientation estimation in a SmartRoom environment equipped with multiple microphones. The ratio between the high and low band energies (HLBR) received at each microphone has been shown in our previous work to be a potentially approach to estimate the direction of the voice produced by a speaker. In this work, for each microphone pair, a smoothed CPS phase is obtained by a proper windowing of the main peak of the crosscorrelation sequence estimated with the GCC-PHAT method, and a HLBR is computed from the processed CPS. The proposed method keeps the computational simplicity of the HLBR algorithm while adding the robustness offered by the GCCPHAT technique. Experimental preliminary results were conducted over a database recorded purposely in the UPC Smart room, and over the CLEAR head pose database. The proposed method performs consistently better than other state-of-the-art techniques with both databases.
منابع مشابه
GCC-PHAT based Head Orientation Estimation
This work presents a novel two-step algorithm to estimate the orientation of speakers in a smart-room environment equipped with microphone arrays. First the position of the speaker is estimated by the SRP-PHAT algorithm, and the time delay of arrival for each microphone pair with respect to the detected position is computed. In the second step, the value of the crosscorrelation at the estimated...
متن کاملExperimental evaluation of multi-band position-pitch estimation (m-popi) algorithm for multi-speaker localization
This paper proposes an enhancement and evaluates the performance of the joint position and pitch estimation (PoPi) algorithm for speaker localization. A modification in the algorithm is introduced in order to improve the performance under high reverberation levels. The performance of the proposed method is evaluated by measuring the correct estimate of position at a frame level. This evaluation...
متن کاملSpeaker Localization Using Two-Channel Microphone on the SIG-2 Humanoid Robot
Speaker localization is one of the most important techniques to achieve natural and intelligent humanrobot interaction (HRI) because robots need to 1) identify the direction of a talker through the measurements of the acoustic signals from microphones, and 2) watch at the position of a talker for notifying that they are now ready to receive an order or express their interest in conversation. Mo...
متن کاملEffect of head orientation on the speaker localization performance in smart-room environment
Reliable measures of speaker positions are needed for computational perception of human activities taking place in a smart-room environment. In this work, we investigate the effect of talkers head orientation on the accuracy of acoustical source localization techniques and its relation with the talker directivity pattern and room reverberation. Two different representative speaker localization ...
متن کاملPerformance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions
TDOA(time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008